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Recently, a lot of attention has been devoted to finding physically 
realisable operations that realise as closely as possible certain de- 
sired transformations between quantum states, e.g. quantum cloning, 
teleportation, quantum gates, etc. Mathematically, this problem boils 
down to finding a completely positive trace-preserving (CPTP) lin- 
ear map that maximizes the (mean) fidelity between the map itself 
and the desired transformation. In this note we want to draw atten- 
tion to the fact that this problem belongs to the class of so-called 
semidefinite programming (SDP) problems. As SDP problems are 
convex, it immediately follows that they do not suffer from local op- 
tima. Furthermore, this implies that the numerical optimization of 
the CPTP map can, and should, be done using methods from the 
well-established SDP field, as these methods exploit convexity and 
are guaranteed to converge to the real solution. Finally, we show how 
the duality inherent to convex and SDP problems can be exploited to 
prove analytically the optimality of a proposed solution. We give 
an example of how to apply this proof method by proving the opti- 
mality of Hardy and Song's p roposed solution for the universal qubit 
0-shifter (|quant-ph/0102100 



The sum in this equation must be an integral with an appropri- 
ate measure for k if k is continuous. In terms of the operator 
X, the fidelity is given by 
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The basic problem considered by a number of authors 
is: what physically realisable quantum operation 
comes closest to a given, but potentially unphysical, transfor- 
mation between quantum states? The operation is most gener- 
ally described by a linear map $; the physical readability re- 
quires that the map is completely positive and trace-preserving 
(CPTP). The desired transformation can be specified in a num- 
ber of ways, for example by enumerating all possible input- 
output pairs of pure states {|in, k), |out, k)}. The dimensions 
of the input and output Hilbert spaces, Tti n and 7i out , denoted 
di and d-i, respectively, can in general be different. The sym- 
bol k labels the different pairs and can either be discrete or 
continuous. 

In the most commonly used formalism, the CPTP map $ 
that is to implement the transformation is represented by an 
operator X acting on the Hilbert space Tim <8> Hout- The re- 
quirements of complete positivity and trace preservation result 
in the constraints 

X > 
Tr out X = 11 i n . 

The requirement that the map must implement the transfor- 
mation as closely as possible can be quantified by the mean 
fidelity F: 

F = ^(out,k|$(|in,fe)(in,fe|)|out,k). 
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F = TrXR, 



(|in, fc)(in, k\) T <8> |out,k) (out,k| 



The great virtue of this measure-of-goodness of the map is 
that the fidelity is linear in the operator X. In this way the 
problem has been formulated as an optimization problem: 

maximize Tr XR 
(P): \ X > 

Tr 0Ut X = lj n 

In general, optimization problem (P) cannot be solved analyt- 
ically and one must resort to numerical methods. Most au- 
thors try to solve (P) using ad-hoc iteration schemes involving 
Lagrange multipliers. Using these schemes, various useful 
results have been obtained. However, in our view, the con- 
vergence properties of these schemes are questionable, as it 
has not been proved that the solution obtained is actually the 
global optimum. In fact, these methods reportedly get stuck 
now and then in suboptimal local optima 

In this note we wish to draw attention to the fact that prob- 
lem (P) belongs to a well-studied class of optimization prob- 
lems called semidefinite programs (SDP). The importance of 
this fact cannot be overestimated. First of all, semidefinite 
programs are a subclass of the class of convex optimization 
problems, and convex problems have the very desirable prop- 
erty that a local optimum is automatically a global optimum. 
Keeping this in mind we see that the reported presence of lo- 
cal optima in the above iteration schemes is due to the scheme 
itself, and not to the problem being solved. 

Secondly, very efficient numerical methods have been de- 
vised to solve SDPs, as these problems occur over and over 
again in various engineering disciplines, operations research, 
etc. These methods have very good convergence properties, 
and, moreover, they yield numerical intervals within which the 
solution must lie. Using a sufficient number of iterations, the 
width of this interval can be made arbitrarily small (apart from 
numerical errors and given the validity of some technical re- 
quirements). In other words: convergence to the real solution 
is almost always guaranteed. This is to be contrasted with or- 
dinary methods, which typically yield one outcome only, and 
it is difficult to know how far its value is removed from the 



1 



real solution, especially when the optimization problem has 
multiple local optima. 

Thirdly, the way in which these numerical methods work 
can be exploited to prove analytically that a given proposed 
solution, e.g. an analytical Ansatz based on an educated guess 
and on the outcome of numerical experiments, is actually the 
correct solution. 

In the rest of this section we will first discuss the basic 
mathematical facts of semidefinite programming and then ap- 
ply them to the problem at hand. For a short introduction to 
the subject, we refer to [Qj, and for an in-depth treatment to 
|^]. Note that presents another application of SDP to quan- 
tum mechanics, namely to finding bounds on the distillable 
entanglement of mixed bipartite quantum states. 

The basic SDP problem is the minimization of a linear func- 
tion of a real variable x E R m , subject to a matrix inequality: 

minimize c T x 

F(x) = F + YZi x i p i > 

where the >-sign means that F(x) is positive semidefinite 
(hence the term SDP). The problem data are the vector c 6 
R m and the m + 1 real symmetric matrices Fi . Alternatively, 
the Fi can also be complex Hermitean but this is an atypical 
formulation within the SDP community (in engineering one 
typically deals with real quantities). 

This problem is called the primal problem. Vectors x that 
satisfy the constraint F(x) > are called primal feasible 
points, and if they satisfy F(x) > they are called strictly 
feasible points. The minimal objective value c T x is by con- 
vention denoted as p* (no complex conjugation!) and is called 
the primal optimal value. 

Of paramount importance is the corresponding dual prob- 
lem, associated to the primal one: 

maximize — Tr FqZ 

Z>0 

Tr FiZ — a, i= l..m 

Here the variable is the real symmetric (or Hermitean) matrix 
Z, and the data c, Fi are the same as in the primal problem. 
Correspondingly, matrices Z satisfying the constraints are 
called dual feasible (or strictly dual feasible if Z > 0). The 
maximal objective value — Tr FqZ, the dual optimal value, is 
denoted as d* . 

The objective value of a primal feasible point is an upper 
bound on p* , and the objective value of a dual feasible point 
is a lower bound on d*. The main reason why one is interested 
in the dual problem is that one can prove that, under relatively 
mild assumptions, p* = d*. This holds, for example, if either 
the primal problem or the dual problem are strictly feasible, 
i.e. there either exist strictly primal feasible points or strictly 
dual feasible points. If this or other conditions are not ful- 
filled, we still have that d* < p* . Furthermore, when both the 
primal and dual problem are strictly feasible, one proves the 
following optimality condition on x: x is optimal if and only 
if x is primal feasible and there is a dual feasible Z such that 



ZF(x) = 0. This latter condition is called the complementary 
slackness condition. 

In one way or another, numerical methods for solving SDP 
problems always exploit the inequality d < d* < p* < p, 
where d and p are the objective values for any dual feasible 
point and primal feasible point, respectively. The difference 
p — d is called the duality gap, and the optimal value p* is 
always "bracketed" inside the interval [d,p\. These numeri- 
cal methods try to minimize the duality gap by subsequently 
choosing better feasible points. Under the requirements of the 
above-mentioned theorem, the duality gap can be made arbi- 
trarily small (as far as numerical precision allows). This is 
precisely the reason why one should be happy when an opti- 
mization problem turns out to be an SDP problem. 

We now apply these generalities to our problem at hand. 
Problem (P) can immediately be rewritten as a (primal) SDP 
problem by noting that the set of Hermitean matrices form 
a real vector space of dimension the square of the matrix di- 
mension. Since we are dealing with matrices over the bipartite 
Hilbert space H m <8> Tiout it is convenient to choose the basis 
vectors of the matrix space accordingly. Let {u J } and {r fe } be 
orthogonal bases for Hermitean matrices over Ti. m and H ou t, 
respectively, then {cr? ® r k } forms an orthogonal basis for 
Tim <8> Hout- Furthermore, choose the bases so that both cr° 
and t° are the identity matrix (of appropriate dimension) and 
all other er^ and r k are traceless Hermitean matrices. An ob- 
vious choice would be the set of Pauli matrices {a x , a y , a z } 
or generalisations thereof to higher dimensions. We thus have 
the following parameterisation of the matrix X: 

d\-\d%-\ 

x = Y x i k<j3 ® rk - 

3=0 k=0 

With this parameterisation, the TP requirement can be ex- 
pressed in a straightforward way. The condition Tr out X = 
tj n = ct° is fulfilled if and only if Xjo = for all j > 0, and 
xoo = 1/^2- By changing the parameterisation of X, this can 
be taken care of implicitly: 

+l/d 2 . 

From this parameterisation, and the additional requirement 
X > 0, it immediately follows that the matrices Fi (in the 
SDP problem) are given by 

F = t/d 2 
F"c = cr j ® r k , with fc ^ 0. 

The index "i" in the left-hand side refers to the i of the SDP 
problem, and corresponds to all possible pairs (j, k) of right- 
hand side indices with fc ^ 0. As a shorthand for summation 
over all these pairs we will use the symbol k- 

Finally, we can assign values to the vector coefficients c; 
as follows. The fidelity F is to be maximized, so we need an 
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additional minus sign; furthermore, in terms of Xjk, F equals 

where we have used the fact that Tri? = 1. This yields for 
the coefficients cf. 

c v = -Tr{a j ® T k R), 

and for the optimal fidelity, in terms of the primal optimal 
value: 

= -P* + l/d 2 . 

Using these expressions for the vector c and the matrices 
Fi (which are only dependent on the dimensions of the prob- 
lem!), one can go about solving the problem (P) numerically. 
As some of the Fi are complex, one has to use SDP software 
that explicitly allows complex entries (e.g. [[)]]). 

Using the above assignments, the dual problem can now 
be formulated in a rather nice way. The dual objective, to be 
maximized over all Z > 0, is 

d = -TtFqZ = -Tr Z/d 2 . 

The constraint Tr FiZ = Cj gets an interesting form: 

Tr(V' <g> T k (Z + R)) = 0, with k ^ 0. 

As Z and R are both Hermitean, this means that the matrix 
Z + R must be of the form Z + R = ao 1 + J2j^o a i a ^ ® ^' 
or, in other words, 

Z = a l + A®1 - R, 

with A a traceless Hermitean matrix. With this parameterisa- 
tion for all dual feasible Z, the dual objective becomes 

d = -dido + l/d 2 . 

Maximizing d thus amounts to minimizing ao over all trace- 
less Hermitean matrices A such that the resulting Z is still 
positive semidefinite. From the parameterisation of Z one 
sees that the smallest feasible value of ao for a fixed matrix 
A is given by 

a (A) = -A min (A® 1 -R), 

where A m ; n signifies the minimal eigenvalue of the matrix. 
The dual problem finally becomes: find the optimal traceless 
Hermitean matrix A such that this ao(A) is minimal. The dual 
optimal value is then 

d* = — d\ minao(A) + l/aV 
A 

Note that we have significantly reduced the number of un- 
known parameters: from (dia^) 2 for Z to df — 1 for A. 



These expressions for the primal and dual problem can be 
used for proving that a certain proposed solution is optimal. 
To that purpose one needs to propose primal and dual feasible 
points x and A; if the resulting primal and dual objective val- 
ues p and d turn out to be equal to each other, then x and A are 
optimal feasible points and p = d = p* = d* . Alternatively, 
any feasible choice for x and A gives upper and lower bounds 
on the optimal value p*, resulting in lower and upper bounds, 
respectively, for the fidelity of problem (P). For example, set- 
ting A = gives ao(A) = A max (i?) resulting in the upper 
bound F < diX max (R), which was already derived in [ph. 

Using the method of the previous paragraph, one can test 
whether the feasible points are optimal or not, but it does not 
solve the problem of finding these points. As there is no hope 
for solving the primal and dual problems analytically for all 
but the simplest problems, one must resort to numerical meth- 
ods. Luckily, efficient methods abound and some implemen- 
tations are freely available on the web. From the numerical 
results one can then try to guess the analytical form of the so- 
lution, or at least try to propose an Ansatz containing a few 
unknown parameters. If the number of parameters is small 
they could be found by solving the primal and dual problem 
using the Ansatz. 

Even this could be relatively complicated, especially for the 
dual problem, as this is an eigenvalue problem. An alternative 
for solving the dual problem is offered by the complementary 
slackness (CS) condition, which does not require solving an 
eigenvalue equation. Supposing that a correct guess has been 
made for X of the primal problem, one then has to solve the 
linear equation 

(a l +A(g) 1 -R)X = 

in the unknowns ao and A. Of course, one then still has 
to prove that the resulting Z is dual feasible, i.e. is positive 
semidefinite, and this could still require solving an eigenvalue 
problem. 

As an example of this proof technique, we now consider the 
problem of constructing an optimal qubit ^-shifter, first con- 
sidered by Hardy and Song [|]] and prove that their "quantum 
scheme" shifter (see also [0]) is optimal. 

A qubit 0-shifter is a device that transforms a pure state 
if>(6,(p) = cos(6 l /2)|0} + exp(i0)sin(0/2)|l) into another 
pure state %p(d + a,4>). This is a non-physical operation and 
has, therefore, to be approximated. Hardy and Song consider 
both a universal approximated shifter, with fidelity indepen- 
dent of 9, and a shifter with ^-dependent fidelity optimiz- 
ing the mean fidelity. The mean fidelity of the non-universal 
shifter is better than for the universal one, but it has only been 
proven for values of a equal to integer multiples of it/ 2 that it 
has optimal mean fidelity [0]. We will now prove optimality 
for all values of a. 
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The matrix R for the shifter is given by 



R 



n r 5 

r 2 

r 3 

r 5 r 4 



with 



ri 



1/4 + c- s 
1/4 - c + s 

c 

1/4 — c — s and 
1/4 + c + s 
2c 



cos ce 
sin a. 



The Ansatz for the primal feasible point is 



X 



cos 2 /3 



cos /3 



cos/3 
sin 2 /3 



There appear to be two regimes, depending on the value of a. 

For a < ao ~ arctan(8/37r), put cos f3 = 1, and for a > ao, 
cos /3 = c/(s — c). This gives as primal objective fidelity 

F = (1 + cosa)/2, for ct < cto 

F = 1/2 + 2s + 2c 2 /(s - c), for a > a . 

Going over to the dual problem, we now present our own 
Ansatz for the dual feasible point A, which was inspired by 
numerical results: consider diagonal A only. This means that 
A is parameterised by a single number, say S, and equals A = 
5a z . This gives for Z: 



a + S - ri 

a 

~r 5 



r-2 



a 



-r5 


r 3 

ao — 5 — r4 



To prove optimality of both Ansatzes, we use the complemen- 
tary slackness condition (for finding the optimal value for ao 
and 5). The CS condition ZX = gives rise to just three 
independent equations: 

(a + 6 — ri) cos/3 — r 5 = 
(a + <5-r 2 )sin 2 /? = 
(ao — S — r^) — ?'5 cos f3 = 

As could be expected, there are two different solutions: 

a = 1/4 + 3c 

5 = -s 
cos (3=1 



and 



fa 



a = 1/4 + s + c 2 /(s 
S = —s/(s — c) 
ri) cos/3 = 7-5. 



The third equation of each set shows us that the first solution 
pertains to the case a < ao and the second solution to the 
other case. The first solution gives mean fidelity 

F = 2a = (1 + cosa)/2, 

and the second solution 

F = l/2 + 2.s + 2c 2 /(s-c). 

These values are exactly the ones obtained in the primal prob- 
lem, so this proves the optimality of our Ansatzes, provided 
Z > in both cases. It is a basic exercise in linear algebra 
to calculate the eigenvalues of Z in both cases; noting that 
< s < c in the case a < ao, and c < s in the other case, 
one can indeed show that Z is always positive semidefinite, 
proving its feasibility. 

To conclude, we have noted that the problem (P), which 
has to be solved for finding CPTP maps that optimally ap- 
proximate certain desired qubit-transformations, is a semidef- 
inite programming (SDP) problem. From this observation, it 
follows that (P) can be efficiently solved using standard SDP 
software, and that there is no need for ad-hoc solution meth- 
ods, which could suffer from bad convergence properties. Fur- 
thermore, we presented a method for proving analytically that 
an Ansatz for the solution of (P) is optimal. We hope that the 
present work will be useful for those working in the field of 
determining optimal CP maps or optimal quantum measure- 
ments. 
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